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Approximating Bin Packing within 
0(log OP T • log log OP T) bins 

Thomas Rothvofi* 
Massachusetts Institute of Technology 

Abstract 



For bin packing, the input consists of n items with sizes si,...,s n £ [0,1] which have 
to be assigned to a minimum number of bins of size 1. The seminal Karmarkar-Karp 
algorithm from '82 produces a solution with at most OPT + 0(log 2 OPT) bins. 

We provide the first improvement in now 3 decades and show that one can find a 
solution of cost OP T + 0(log OP T ■ loglog OP T) in polynomial time. This is achieved by 
\ rounding a fractional solution to the Gilmore-Gomory LP relaxation using the Entropy 

C/5 ■ Method from discrepancy theory. The result is constructive via algorithms of Bansal and 

, Q | ' Lovett-Meka. 

(N . 

> . 1 Introduction 

O i 

Bin Packing is one of the very classical combinatorial optimization problems studied in com- 
puter science and operations research. It's study dates back at least to the 1950's IEis57l and 
it appeared as one of the prototypical NP-hard problems in the book of Garey and Tohn- 
son |GJ79|. For a detailed account, we refer to the survey of I CGJ84 1 . Bin Packing is also 
a good case study to demonstrate the development of techniques in approximation algo- 
rithms. The earliest ones are simple greedy algorithms such as the First Fit algorithm, an- 
alyzed by Johnson |Joh73| which requires at most 1.7 • OPT + 1 bins and First Fit Decreas- 
ing lJDU + 74| , which yields a solution with at most ^-OPT + 4 bins (see ID6s07l for a tight 
bound of ^-OPT + |). Later, Fernandez de la Vega and Luecker lFdlVL81l developed an 
asymptotic PTAS by introducing an item grouping technique that reduces the number of dif- 
ferent item types and has been reused in numerous papers for related problems. De la Vega 
and Luecker were able to find a solution of cost at most (1 + e)OPT+ O(p-) for Bin Packing 
and the running time is either of the form 0(n^ (E) ) if one uses dynamic programming or of 
the form 0{n ■ f{e)) if one applies linear programming techniques. 

A big leap forward in approximating bin packing was done by Karmarkar and Karp in 
1982 |KK82|, who provided an iterative rounding approach for the mentioned linear pro- 
gramming formulation which produces a solution with at most OPT + 0(log 2 OPT) bins in 
polynomial time, corresponding to an asymptotic FPTAS. 
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Both papers |FdlVL81, KK82| used the Gilmore-Gomory LP relaxation (see e.g. |Eis57, 
IGG6D ) 

min{l r x|^x = l,x>0} (1) 

where A is the pattern matrix that consists of all column vectors {p e Z" \ p T s < I}. Each 
such column p is called a (valid) pattern and corresponds to a feasible multiset of items that 
can be assigned to a single bin. Note that it would be perfectly possible to consider a stronger 
variant in which only patterns p e {0, 1}" are admitted. In this case, the LP Q) could also be 
interpreted as the standard (Unweighted) Set Cover relaxation 

minj£x s | £ x s > 1 Vi £ [n];x s > VS £ Sf\ (2) 

Se5^ SeSf-.ieS 

for the set system SP :- {S £ [n] | T.ies s i — !}• However, the additive gap between both ver- 
sions is at most O(logn) anyway, thus we stick to the matrix-based formulation as this is 
more suitable for our technique^ . 

Let OPT and OPTf be the value of the best integer and fractional solution for {TJ re- 
spectively. Although {TJ has an exponential number of variables, one can compute a basic 
solution x with l T x < OPTf + 8 in time polynomial in n and 1/5 |KK82| using the Grotschel- 
Lovasz-Schrijver variant of the Ellipsoid method | GLS81 1 . Alternatively, one can also use the 
Plotkin-Shmoys-Tardos framework |PST95| or the multiplicative weight update method (see 
e.g. the survey of |AHK12 |) to achieve the same guarantee. 

The Karmarkar-Karp algorithm operates in logn iterations in which one first groups the 
items such that only |Eie[n] S; many different item sizes remain; then one computes a ba- 
sic solution x and buys [x p \ times pattern p and continues with the residual instance. The 
analysis provides a 0(log 2 OP T) upper bound on the additive integrality gap of {TJ . In fact, it 
is even conjectured in |ST97| that {TJ has the Modified Integer Roundup Property, i.e. OPT < 
\OPTf] + 1 (and up to date, there is no known counterexample; the conjecture is known 
to be true for instances that contain at most 7 different item sizes ISS09I ). Recently, IEPR11I 
found a connection between coloring permutations and bin packing which shows that Beck's 
Three Permutations Conjecture (any 3 permutations can be bi-colored with constant discrep- 
ancy) would imply a constant integrality gap at least for instances with all item sizes bigger 
than i. Note that the gap bound of the Karmarkar-Karp algorithm is actually of the form 
OQogOPiy • log(maXjj{^})), which is O(logn) for such instances. But very recently New- 
man and Nikolov |NNN12 | found a counterexample to Beck's conjecture. 

Considering the gap that still remains between upper and lower bound on the additive 
integrality gap, one might be tempted to try to modify the Karmarkar-Karp algorithm in order 
to improve the approximation guarantee. From an abstract point of view, |KK82| buy only 
patterns that already appear in the initial basic solution x and then map every item to the slot 
of a single larger item. Unfortunately combining the insights from |NNN12| and IEPR11I . 
one can show that no algorithm with this abstract property can yield a o(log 2 n) gap, which 
establishes a barrier for a fairly large class of algorithms | EPR13 1 . 

A simple operation that does not fall into this class is the following: 



^For example, if the input consists of a single item of size p then the optimum value of is 1, while the 
optimum value of (T) is i. But the additive gap can be upper bounded as follows: Take a solution x to (T] and 
apply a single grouping via Lemma[6]with parameter ji = 1. This costs Oflog n) and results in a solution to (2) for 
some general right hand side vector b e Z" Q . With the usual cloning argument, this can be easily converted into 
the form with right hand side 1. 
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Gluing: Whenever we have a pattern p with x p > that has many copies of the 
same item, glue these items together and consider them as a single item. 

In fact, iterating between gluing and grouping, results in a mapping of several small input 
items into the slot of a single large item - the barrier of |EPR13| does not hold for such a 
rounding procedure. 

But the huge problem is: there is no reason why in the worst case, a fractional bin packing 
solution x should contain patterns with many items of the same type. Also the Karmarkar- 
Karp rounding procedure does not seem to benefit from that case either. However, there is 
an alternative algorithm of the author | Rotl2l to achieve a 0(log 2 OP T) upper bound, which 
is based on Beck's entropy method |Bec81, Bec88, Spe85J (or partial coloring lemma) from 



discrepancy theory. This is a subfield of combinatorics which deals with the following type of 
questions: given a set system S\,...,S n £ [m], find a coloring of the elements 1,..., m with red 
and blue, such that for each set S, the difference between the red and blue elements (called 
the discrepancy) is as small as possible. 



2 Outline of the technique 

The partial coloring method is a very flexible technique to color at least half of the elements 
in a set system with a small discrepancy but the technique is based on the pigeonhole prin- 
ciple — with exponentially many pigeons and pigeonholes — and is hence non-constructive 
in nature^. But recently Bansal IBanlOl and later Lovett and Meka ILM12I provided polyno- 
mial time algorithms to find those colorings. In fact, it turns out that our proofs are even 
simpler using the Lovett-Meka algorithm than using the classical non- constructive version, 
thus we directly use the constructive method. 

The constructive partial coloring lemma 

The Lovett-Meka algorithm provides the following guarante^l 

Lemma 1 (Constructive partial coloring lemma ILM12I ). Let x e [0, 1] m be a starting point, 
8 > an arbitrary error parameter, V\,...,v n EQi m vectors and Ai X n > parameters with 

f>- A ? /16 ^. (3) 

!=1 ib 

Then there is a randomized algorithm with expected running time 0( (m g 2 n) log(-^)) to com- 
pute a vector y e [0, 1] m with 



2 The claim of the Partial coloring lemma is as follows: Given any vectors v\,... ,v n eR m with a parameters 



X\ , . . . , X n > satisfying 



m . . |9e- A/5 ifA>2 



Y GikA < — , for G(X):=, 

iSi 5 j l0 g 2 (32 + §4) ifA<2 



Then there is a partial coloring % '■ [m] — > {0, ±1} with |supp(^)| > ^ and \ vix\ £ A;||y,-||2 for all vectors i = l,...,n. 

3 The original statement has x,ye [-l,l] m and|yy| > 1-5 for half of the entries. However, one can obtain our 
version as follows: Start with xe [0,1] m . Then apply ILM12 to x' := 2x-l e [-1,1] m to obtain y'e [-1,1] m with 
\Vi{y' - x'} \ < A; || V; || 2 for all i e [n] and half of the entries satisfying \ y'.\ > 1 - S . Then y := i(y' + 1) has half of 

the entries yj e [0, |] U [1- f ,1]. Furthermore, |f,(y-x)| = \ \viiy' -x')\ < V/ \\z- 
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• yj e [0, 5] u [1 - 8, 1] for at least half of the indices j E {1, . . . , m] 

• | Viy— Vix\ < Xi • || V1W2 for each i e {1, ...,n}. 

If we end up with an almost integral Bin Packing solution y, we can remove all entries 
with yj < j- and roundup those with yj e [1 - j-, 1] paying only an additional constant term. 
Thus we feel free to ignore the 8 term and assume that half of the entries are yj e {0, 1}. 

The algorithm in |LM12| is based on a simulated Brownian motion in the hypercube 
[0, 1] m starting at x. Whenever the Brownian motion hits either the boundary planes yj = 
or yj = 1 or one of the hyperplanes Vi(x-y) = +A; || Vj H2, the Brownian motion continues the 
walk in that subspace. By standard concentration bounds, the probability that the walk ever 
hits the ith hyperplane is upperbounded by e"^ 1 ;'. In other words, condition CD says that 
the expected number of hyperplanes Vi(x-y) = +A; || V1W2 that ever get hit is bounded by j|, 
from which one can argue that a linear number of boundary constraints must get tight. 

Readers that are more familiar with approximation algorithm techniques than with dis- 
crepancy theory, should observe the following: In the special case that X, ■ = for all i, one 
can easily prove Lemmafflby choosing y as any basic solution of {y | vty = VixMi e In]; < 
y < 1}. In other words, Lemma[T]is somewhat an extension of the concept of basic solutions. 
Considering that a significant fraction of approximation algorithms is based on the sparse 
support of basic solutions, one should expect many more applications of ILM121 . 

The rounding procedure 

Let x be a fractional solution for the Gilmore-Gomory LP CO, say with |supp(x)| = m<n and 
let A be the constraint matrix reduced to patterns in the support of x. 

Assume for the sake of simplicity that all items have size between 4 and | for some k. We 
now want to discuss how LemmaCQcan be applied in order to replace x with another vector y 
that has half of the entries integral and is still almost feasible. Then repeating this procedure 
for log(m) iterations will lead to a completely integral solution. For the sake of comparison: 
the Karmarkar-Karp algorithm is able to find another fractional y that has at most half the 
support of x and is at most an additive 0(1) term more costly. So let us argue how to do 
better. 

Let us sort the items according to their sizes (i.e. | > s\ > . .. > s n > j-) and partition the 
items into groups I\,...,I t such that the number of incidences in A is of order lOOfc for each 
group. In other words, if we abbreviate vi } := Lie/, A' a s the sum of the row vectors in Ij, 
then || vij || 1 w 100A;. Since each column of A sums up to at most k and each group consums 
lOOfc incidences, we have only t < many groups. Now, we can obtain a suitable y with at 
most half the fractional entries by either computing a basic solution to the system 

v Ij {x-y) = Vj e[t], l T y=l T x, 0<y<l 

or by applying the Constructive Partial Coloring Lemma to vj l ,...,vj t and f bj := (1,...,1) 
with a uniform parameter of X := 0. In fact, since (f + 1) • e _ ° 2/16 < M. + 1 < ™ condition CD 
is even satisfied with a generous slack. The meaning of the constraint vj{x- y) = is that 
y still contains the right number of slots for items in group /. But the constraint does not 
distinguish between different items within /; so maybe y covers the smaller items in / more 
often than needed and leaves the larger ones uncovered. However, it is not hard to argue that 
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after discarding lOOfc items, y can be turned into a feasible solution, so the increase in the 
objective function is again 0(1) as for Karmarkar-Karp. 

Now we are going to refine our arguments and use the power of the entropy method. 
The intuition is that we want to impose stronger conditions on the coverage of items within 
groups. Consider a group / := Ij and create growing subgroups Gi £ G2 E ... £ Gn £ = I 
such that the number of incidences grows by e • 100/c from subgroup to subgroup, for some 
£ > (later £ := , \ will turn out to be a good choice; see Figure [Tl (a)). In other words, 

log n 

II v Gj+i\Gj 111 ~ £-100fc. We augment the input for LemmaQ]by the vectors v G for all subgroups 

equipped with parameter Xq := 4yln(i). Observe that condition © is still satisfied as each 

of the I many subgroups G contributes only e~ A G /16 < e . So, we can get a better vector y 

that also satisfies \vg(x- y)\ < 4^/ln(~) • || fell 2 f° r an Y subgroup. In order to improve over 
our previous approach we need to argue that || VqWi « k- But we remember that by defini- 
tion || Vg\\\ ^ lOOfc, thus we obtain || Vc\h ^ vli v g\\\ • II ^gIIoo - lOOfc- \/\\ vg\\oo/\\ vg\\i- m other 
words, the only situation in which we do not immediately improve over Karmarkar-Karp is 
if II vgWoo ^ £2(11 VgWi), i.e. if there is some pattern such that a large fraction of it is filled with 
items of the same subgroup G. 




q items 



(a) rounding [b) gluing 

Figure 1: (a) visualization of groups and subgroups, (b) visualization of the gluing procedure. 



The gluing 

At this point our gluing operation comes into play. After a simple pre-rounding step which 
costs us a o(l) term, we can assume that all entries in x are multiples of \ := , \ . Recall 

r q log n 

that initially we have a single copy from each item. We group consecutive items together 
into groups of size f> = lo ^ n and round their sizes to the smallest one in the group. By stan- 
dard arguments this incurs a negligible cost of 0( , \ ). Now we can assume that we have a 

o log n 

sufficiently large number of copies for every item. Suppose that after this agglomeration we 
find an item i and a pattern p in the support such that indeed pi is large, say p,-s,- > ^-4— . 

log n 

The crucial observation is that this pattern p alone covers w := L^-J many copies of item i in 
the input since x p ■ pi > w. Next, take w many copies of item i in p and glue them together 
to obtain a new, bigger item V of size v = w-si. The pattern p has enough items to do this 
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q times, see Figure [TJ(fo). In other words, the modified pattern now contains q copies of a 
new artificial item V . The reason why we want q copies of this new item is that the modified 
pattern p alone covers q ■ x p = 1 copies of i' . Thus, in a finally obtained integral solution we 
would have a slot for the artificial item i', which we can then replace with the copies of the 
original item i. 

Observe that the size of this newly obtained item type is s,< = w- si> i 2 - . So we call 
items above that size large and below that size small. The interesting effect is that if we ap- 
ply this gluing procedure to all small items whenever possible, the penally that we pay for 
rounding the remaining small items is so small that the overall cost is completely dominated 
by the contribution of the large items (i.e. those items that were either large from the be- 
ginning or that were created during the gluing process) . In other words, we obtain the same 
approximation guarantee as if the instance would only contain items of size at least i 2 ~ 
from the beginning on; for those instances already |KK82| produces a solution with at most 
OPTf + O(logn-loglogn) bins, so this is our final approximation guarantee for all instances. 



Contribution 

Our main contribution is the following theorem: 

Theorem 2. For any Bin Packing instance Si,..., s n e [0, 1], one can compute a solution with 
at most OPTf + O (log OP Tf - log log OP Tf) bins in expected time 0(n 6 log 5 (n)), where OPTf 
denotes the optimum value of the Gilmore-Gomory LP relaxation. 

This partly solves problem #3 in the list of 10 open problems in approximation algo- 
rithms stated by Williamson and Shmoys | WS1 1 1 (they asked for a constant integrality gap) . 



3 Related work 

The classical application of the partial coloring lemma is to find a coloring % '■ [m] — ► {+1} for 
m elements such that for a given set system Si,..., sjfl the discrepancy max; £ [„] \ T.jeSi XC/)I 
is minimized. For example, one can obtain Spencer's bound |Spe85 | on the discrepancy of 
arbitrary set systems, by applying log m times Lemma[T]starting with x := {\, . . . , \) and a uni- 
form bound of A := C^log^ where Vf e {0, l} m is the characteristic vector of S;. This results 

in a coloring % : [m] — {±1} with I j(S)| < 0[y mlog^). Note that e.g. for n < O(m), this is a 

0{\/7n) coloring, while a pure random coloring would be no better than 0( \Jm-\ogm). 

Other applications of this method give a 0{\/tlog m) bound if no element is in more than 
t sets |Sri97| and a 0(\Zfclogm) bound for the discrepancy of k permutations |SST|. For the 
first quantity, alternative proof techniques give bounds of 2 t—l |BF81| and 0{\J f-logm) IBan98l . 

In fact, we could use those classical techniques and extend I Rotl2l to obtain a OPTf + 
0{logOPTf ■ loglogOPTy) integrality gap result. It might appear surprising that one can 
bound integrality gaps by coloring matrices, but this is actually a well known fact, which is 
expressed by the Lovasz-Spencer-Vesztergombi Theorem |LSV86|: Given a matrix A and a 

4 The standard notation in discrepancy theory is to have n as number of elements and m as the number of 
sets. However, that conflicts with the standard notation for Bin Packing, where n is the number of items which is 
essentially the number of (/-vectors. 
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vector x e [0, l] m such that any submatrix of A admits a discrepancy a coloring. Then there 
is a y e {0, \} m with || Ax - AyW^ < a. For a more detailed account on discrepancy theory, we 
recommend Chapter 4 in the book of Matousek IMat99l. 

4 Preliminaries 

In the Bin Packing literature, it is well known that it suffices to show bounds as in Theorem[2] 
with an n instead of OP TV and that one can also assume that items are not too tiny, e.g. 
s,- > i. Though the following arguments are quite standard (see e.g. |KK82|), we present 
them for the sake of completeness. 

Lemma 3. Assume for a monotone function f , there is a poly{m) -time OPTf + f{n) algorithm 
for Bin Packing instances s e [0, l] m with |{Sj | i e [m]}\ < n many different item sizes and 
min{s,- | i e [m]} > -. Then there is a polynomial time algorithm that finds a solution with at 
mostOPTf + f{OPTf) + 0{logOPT f ) bins. 

Proof. Let 5 e [0, l] m be any bin packing instance and define a := L as their size. First, 
split items into large ones L := {i e \m\ \ s\ > \\ and small ones S := [i e [m] | < -}. 

We perform the grouping procedure from |KK82| (or from Lemma [6} to large items L 
and produce an instance with sizes 5' such that each size s'. that appears has T.j-. s '=s'. s 'i — 1- 

Moreover, after discarding items of total size at most 0(log gg^jqfgn ) ^ O(logfj) one has 
OPT'^ < OPTf. Thus the number of different item sizes in s' is bounded by a. We run the as- 
sumed algorithm to assign items in L to at most OPT^ + f{a) < OPTf + f {OPTf) bins (using 
that OPTf > a and / is monotone). Adding the discarded items increases the objective func- 
tion by at most another 0(log OP Tf) term. Now we assign the small items greedily over those 
bins. If no new bin needs to be opened, we are done. Otherwise, we know that the solution 
consists of k bins such that k - 1 bins are at least 1 - ^ full. This implies a > (k- 1) • (1 - 
and hence k < a + 3 < OP Tf + 3 assuming a > 2. □ 

From now on, we have the implicit assumption Si > ^. In an alternative Bin Packing 
definition, also called the cutting stock problem, the input consists of a pair {s, b) such that 
bi £ Z>o gives the number of copies of S{. If b is unary encoded, this problem version is 
polynomially equivalent to classical Bin Packing. Otherwise, it is even unknown whether for 
n = 3 the problem admits a polynomial time algorithm (for n = 2 it can be solved in poly- 
nomial time and for any constant n, there is a polynomial time OPT + 1 algorithm |JSO10|). 
Moreover one cannot expect to even solve the LP Q) up to an additive constant in polyno- 
mial time. But the Karmarkar Karp bound of 0(log 2 n) on the additive integrality gap still 
holds true (when n is the number of item types). In this paper, we will work with a more 
general formulation in which any b e cone{p e Z" | s T p < 1} may serve as vector of multi- 
plicities (note that such a vector might have fractional entries) . From our starting solution x, 
we can immediately remove the integral parts [x p \ and assume that < x < 1, which has the 
consequence that L" =1 Sibi < n. 

It will be useful to reformulate bin packing as follows: consider a size vector s e [0, l] n 
(si > . . . > s n ) with pattern matrix A and a given vector x e M™ as input and aim to solve the 



7 



following problem 

minl T y (4) 

y e 

We write y > x if L;<; A/y > L/<z AjX for all z' £ [n]. In words: we have a fractional solution 
x to LP {TJ for an instance with A,x many items of type i in the input and aim to find an 
integral solution y that reserves T.j<i Ajy many slots for items of type 1, . . . , i. The condition 
y> x guarantees that y can be easily transformed into a feasible solution by simply assigning 
items to slots of larger items. We make the following observation: 

Observation 1. Consider any instance 1 > Si > ... > s n > with pattern matrix A and vector 
x e R™ such that Ax = 1. Then the value of the optimum integral solution to I0J and QJ 
coincide. 

However, Q] has the advantage that we can split the solution x = [x', x") and then sep- 
arately consider x' and x" while the vector h' = Ax' might be fractional, which is somewhat 
unintuitive when speaking about classical bin packing. When Ax e Z" and y with y > x is 
integral, then it is clear that y defines a solution in which each item represented by multi- 
plicity vector Ax can be mapped to one slot in the patterns of y. However, if Ax is fractional, 
this assignment might be fractional as well. In the following, we discuss the properties of 
such a mapping. 

Consider a vector x e M™ Q with a candidate solution y e Z™ Q . We define a bipartite supply 
and demand graph G = {U u V, E) with demand nodes U = {u\,. . . , u n } on the left and supply 
nodes V = {v\,...,v„} on the right. Demand node w; has a demand of dem(z') := Ax, and i/j 
has a supply of sup(z') := A/y. Moreover, we have an edge («,-, Vj) e E for all i > j. We should 
emphasize that it is crucial that the items are sorted according to their sizes. 




We state the following very easy lemma (as usual N{U') := {v e V | 3u e U' : {u, v) e E} is 
the set of neighbours of [/')• 

Lemma 4. Let any instance 1 > si > . . . > s n > with vectors (x, y) e W™ xZ™ be given. Then 
the following is equivalent: 

a) x<y 

b) sup({l,...,z'}) > dem{{l,...,i}) Vz e [n] 
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c) sup{N{U'))>dem(U')VU' 

d) The following transportation problem 



E Z iJ 



= dem{Ui) Vz e [n] 




< sup(fy) Vj £ [n] 



> V{i,j)aE 



has a (fractional) solution z. 

Moreover, there is always an optimum integral solution y to (4} with assignment z satisfying 
ISJ such thatza > LA-xJ for all i £[n]. 

Proof. The equivalence of a) and fo) is by definition. Hall's theorem provides that c) and d) 
are equivalent. To seethe equivalence of b) and c), let i* := max{z e U'}, thendem({l,. .. , /*}) > 
dem(CT) > sup(iV([/')) = sup({i/i . . . , Vf}). 

It remains to show the "moreover" part. Let y > x with y e Z™ be given. We group the 
fractional parts of dem(l),... ,dem(n) to obtain a new demand function with dem' ({ 1,... , i}) = 
[dem({l, . . . , z})l and dem'(z) e Z for all i e [n] . Since sup(j') is integral, condition b) is still sat- 
isfied and Hall's Theorem yields a solution z e Zf Q . Now consider the solution y for 10}. After 
replacing patterns with copies, we can assume that y p e {0, 1}. For each j < i, we replace Zij 
items of type j by items of type i (note that Sj < Sj). This results in an integral vector y' that 
satisfies the claim. □ 



To fix some notation, p denotes a pattern which we interpret either as a multi-set of items 
or as a vector where p,- e Z>o denotes the number of copies of item i contained in p. The 
matrix formed by all possible patterns is denoted by A. Moreover A,- is the zth row of A 
and by a slight abuse of notation, sometimes we interpret p as a column index and write 
A p as the pth column. As usual [n] = {1, . . . , n} and 1 denotes the all-ones vector of suitable 
dimension. For a subset I Q [n], we write s{I) '■= Y.iEi Si- For any k that is a power of 2, we 
denote the subset of items {i £ [n] | 4 < s; < |} as one size class. The quantity m will usually 
refer to the number of patterns in A. 

5 Operations on fractional solutions 

We introduce two useful operations that we can apply to a fractional solution: the classi- 
cal item grouping procedure similar to I FdlVL8l] |KK82l and a novel item gluing operation. 
Finally we show how they can be combined to obtain a well spread instance in which no 
pattern contains a significant fraction of copies of a single item. 

In order to keep the maintained solution feasible in these procedures it will be necessary 
to add some additional patterns. In the classical literature IFdlVL8ll IKK82I this would be 
done with the phrase "discard the following set of items. . . " meaning that those items are 
assigned to separate bins in a greedy manner. We choose to handle this slightly differently. 
We allow additional columns in A - for each i e [n], we add a waste pattern {i}, which can be 



Notation 
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bought in arbitrary fractional quantities at cost 2s ; per copy. For a vector x e R™ represent- 
ing a fractional solution, we write 

{1,2S) T X= Y, X P + 2s i x {i) 

p regular pattern {i} waste pattern 

as objective function. During our rounding algorithm, we do not make any attempt to round 
entries jc/,-j belonging to waste patterns to integral values. This can be easily done at the very 
end as follows: 

Lemma 5. Let x e M™ and suppose that \p\ = 1 for all p e supp[x) . Then there is an integral 
y>x with s T y < s T x+ 1. 

Proof. By adding dummy copies, we may assume that x„ = \ for all p (for some large num- 
ber a). Sort the patterns p\,...,pq such that the item sizes in those patterns are non-increasing. 
Buy each gth pattern starting with p q plus one copy of p\ . □ 

Finally, any set of Bin Packing items S £ [n] can be assigned to at most 2£ ieS S; + 1 bins 
using a First Fit assignment, which is the reason for the penalty factor of 2 for waste patterns. 

5.1 Grouping 

The operation of grouping items is already defined by de la Vega and Luecker in their asymp- 
totic PTAS for Bin Packing |FdlVL81|. For some parameter k, they form groups of k input 
items each and round up the item sizes to the size of the largest item in that group. This es- 
sentially reduces the number of different item types by a factor of k. In contrast, we will 
replace items in the fractional solution x with smaller items. The reason for our differ- 
ent approach is that we measure progress in our algorithm in terms of |supp(x)|, while e.g. 
Karmarkar-Karp measure the progress in terms of the total size of remaining input items. As 
a consequence we have to be careful that no operation increases |supp(x) |. 

Lemma 6 (Grouping Lemma). Let x e U™ be a vector, /3 > any parameter and a S Q [n] 
be a subset of items. Then there is an x' > x with identical fractionality as x (except of waste 
patterns) with{l,2s) T x' < (l,2s) r x+0(/Mog(2maX;j £ s{j;}), andforanyi e S, either A,-x' = 
orsiAix'>/3. 

Proof. It suffices to consider the case in which a < s,- < 2a for all i e S and show that the 
increase in the objective function is bounded by 0(/3). The general case follows by applying 
the lemma to all size classes S n {i e [n] | {\) e+l < < i\) e }. We also remove those items that 
have already s; A,-x > /3 from S since there is nothing to do for them. 

We consider the index set / := {(z, p) | i e S,p e supp(x)}. For any subset G £ /, we define 
the weight as w[G) := T.(i,p)eG s iPi x p- Note that any single index has weight w{{{i,p)}) = 
sipiXp < SiAtx < f> by assumption. Hence we can partition / = G\ u . . . 0G r such that 

• it/(Gjfc)e[2j6,4j8] Vfc = l,...,r-1 

• w{G r )<2p 

• U,p) e G k ,{i',p') e G fc+1 ^> i < i' 
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Now, for each k e {1, . . . , r - 1} and each index (z, p) e G^, we replace items of type i in p with 
the smallest item type that appears in G^. Furthermore, for indices [i,p) e G r , we remove 
items of type i from p. Finally, we add 4^ many copies of the largest item in / to the waste 

(note that the number 4^ can be fractional and even 4^ « 1 is meaningful). Let x' denote 
the emerging solution. Clearly, x' only uses patterns that have size at most 1. Moreover, 
(l r ,2s)x'-(l r ,2s)x<4^-2max{s ; - 1 z' e S} < 16/3. 

It remains to argue that x' > x. Consider any item i e [n] and the difference Ajx' - 
Y.j<i AjX. There is at most one group G^ whose items were (partly) larger than i in x and then 
smaller in x' . The weight of that group is w{Gt) < 4/3, thus their "number" is T.u,p)eG k Pi x p — 

We add at least this "number" of items to the waste, thus 



5.2 Gluing 

We now formally introduce our novel item gluing method. Assume we would a priori know 
some set of items which in an optimal integral solution is assigned to the same bin. Then 
there would be no harm in gluing these items together to make sure they will end up in the 
same bin. The crucial point is that this is still possible with copies of an item i appearing in 
the same pattern in a fractional solution as long as the contribution x p ■ pi to the multiplicity 
vector is integral, see again Figured] (fo). 

Lemma 7 (Gluing Lemma). Suppose that there is a pair of pattern p and item i with x v = ■£ 
and pi > w - q (r,q,weN)as well as a size sp = w- s z - . Modify x such that w ■ q items of type 
i in pattern p are replaced by q items of type i' and call the emerging solution x' . Then the 
following holds: 

a) The patterns in x' have still size at most one and 1 T x = 1 T x' . 

b) Any integral solution y' > x' can be transformed into an integral solution y>x of the 
same cost. 

Proof. The first claim is clear as q- Sf = wq- 5, . 

Now, let y' > x' be an integral solution and let z' satisfy the linear system (5} w.r.t. x' and 
y' . Then Ayx' > jj ■ q = r e Z>o- By Lemma[4]we may assume that there are r slots for item i' 
in patterns in supp(y). In each of those r pattern p, we resubstitute i' by w copies of i (recall 
that this is size preserving as Sj/ = w-sfj. □ 

Observe that since we allow Ax to be fractional, it is crucial that the gluing operation 
produces integer multiples of an item so that (using the "moreover" part in Lemma |4) we 
can resubstitute an integer number of items. 

Any sequence of grouping and gluing produces a solution which dominates the origi- 
nal instance in the sense that any integral solution for the transformed instance implies an 
integral solution for the original one. 




[i,p)e.G k 



□ 
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Corollary 8. Let s e [0, 1] m and x e M™ be any instance for (U). Suppose there is a sequence 
x = x (0 \ . . . , x (r) with x tr) e Z™ such that for each t e {1, .. . , T}, at least one of the cases is true: 

• (i) x w > x {t ~ l] 

• (ii)x^ emerges from x (r_1) by gluing items via Lemma\j^ 

Then one can construct an integral solution y e Z™ with y>x andl T y < l r x (r) in polyno- 
mial time. 

Proof. Follows by induction over T, the definition of ">" and Lemma[71b). □ 



5.3 Obtaining a well-spread instance 

As already argued in the introduction, a rounding procedure based on the partial coloring 
method would beat |KK82| if the patterns in p e supp(x) would satisfy pi < 8-Aixtox 8 < o(l), 
i.e. they are 8 -well- spread. A crucial lemma is to show that we can combine grouping and 
gluing to obtain a voly l Q%{n) -well- spread solution while loosing a negligible additive polyl 1 og( „ ) 
term in the objective function. 

To simplify notation, let us assume that the vector 5 contains already all sizes k ■ Si for 
i = \,...,n and k e N (even if x does not contain any item of that size) . 

Lemma 9. Let 1 > s\ > . . . > s n > and x e [0, 1] m be given such that for some q e Z>o one has 
x p e for all p e supp(x) and \supp{x) \<n. Choose any parameters 8, f> > and call items 

of size at least e := 8j^ large and small otherwise. Then one can apply Grouping and Gluing 
to obtain a solution x with [l,2s) T x < (l,2s) r x+0(/31og 2 n) and the property that pi < 8-A{X 
for all small items i and all p e supp(x) . 

Proof. First apply grouping with parameter f> to the small items to obtain a vector x' > x with 
(l,2s) T x'< (l,2s) r x+0(/3-logn) such that Sj • A{ x' > f> whenever A\ x' > 0. Now apply gluing 
for each i and p e supp(x'), wherever sipi >28 ■ /3 with maximal possible w. In fact, that 
means w > Lt^J - ^ since > ^ > 1. The size of the items emerging from the gluing 

process is at least w-Si> thus they are large by definition. We have at most q items of 

type i remaining in the pattern and their total size is q • 5/ < q ■ e < Let x" be the new 
solution. 

If after gluing, we still have SiAix" > j, then we say i is well-covered. If indeed all small 
items are well covered, then we are done because SjAip < Siq < ^ < 8 ■ SiAfx" for all small i 
and p e supp(x"). 

Thus, let S := {i small | i not well covered} be the set of those items whose number has 
decreased to less than half due to gluing. We apply again grouping (Lemma© to S (note that 
we do not touch the well covered items). Then we apply again gluing where ever possible 
and repeat the procedure until all small items are well covered. Note that once an item is 
well covered it stays well covered as it is neither affected by grouping nor by gluing. 

In each iteration the waste increases by 0(/3-log n) , thus it suffices to argue that the proce- 
dure stops after at most 0(log n) iterations. Note that the total size of not well covered items 



12 



Li not well covered Si-A-iX decreases by at least a factor of | in each iteration. Moreover at the be- 
ginning we had L" =1 SiAfX < n and we can stop the procedure when Y,i not well covered Si-A-iX < 
pH, which shows the claim. □ 



6 The algorithm 

In this section, we present the actual rounding algorithm, which can be informally stated as 
follows (we give a more formal definition later): 

(1) FOR log n iterations DO 

(2) round x s.t. x p e ^Jfe for all p 

(3) make X p^^ -well spread 

(4) run the constructive partial coloring lemma to make half of the variables integral 

For the sake of comparison note that the Karmarkar-Karp algorithm |KK82| consists of 
step (1) + (4), just that the application of the constructive partial coloring lemma is replaced 
with grouping + computing a basic solution. 



6. 1 Finding a partial coloring 

The next step is to show how Lemma[T]can be applied to make at least half of the variables 
integral. As this is the crucial core procedure in our algorithm, we present it as a stand- 
alone theorem and list all the properties that we need for matrix A. Mathematically speaking, 
the point is that any matrix A that is column-sparse and has well-spread rows admits good 
colorings via the entropy method. 

Theorem 10. Let x e [0, l] m be a vector and 5,e be parameters with < e < S 2 < 1 and let 
A e Z"^ m (with m > 100log(min;{7"})J be any matrix with numbers 1 > S\ > ... > s n > 0. 
Suppose that for any column p e [m], one has A p s < 1 and for any row i with s,- < e one has 
|| Ai || oo £ 8 • || Ai || i . Then there is a randomized algorithm with expected polynomial running 
time to compute a ye [0, l] m with\{j e [m] | y p e {0, 1}}| > y, l T y = l T x and for all i e [n] 

Si>£ 

Si<£ 

Proof. We will prove the claim via a single application of Lemma[TJ In particular we need 
to make a choice of vectors V[ and parameters A;. First of all, it will be convenient for our 
arguments if each individual row has a small norm. So we can split the rows i for large items 
(i.e. Si > e) so that || Ai \\ \ = 1 and we split the rows for small items so that -X < \\ A{ \\ i < 4 and 
II Ai Hoc = 1. In the following let C > be a large enough constant that we determine later. 

First, we partition the items into groups [n] = hi) . . . i)I t such that each group / consists of 
consecutive items and is chosen maximally such that X II Ai || 1 5,- < C and so that / contains 




5 Infact, whenever we have a pattern p with Xp < i we can just move it to the waste. In total over all iterations 
this does not cost us more than an extra 1 term. Then we always have the trivial lower bound SjAfX > \ as 
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only items from one size class. In other words, apart from the log(min, {-^}) many groups that 
contain the last items from some size class, we will have thatE i£ / HAHi^i sC-1. For each 
group /, whether complete or not, we define a vector vj := L^/A with parameter A/ := 0. 

Now consider a group / that belongs to small items, say the items i e / have size j < 
Sj < | for some k. We form subgroups Gi £ G2 £ ... £ G t (i) = I such that || VG j+1 \Gj 111 e 

[CV6k,2CVdk] (this works out since || Alii < ^ = ^ < < \fBk; again the last subgroup 
G t (/) might be smaller). For each subgroup G, we add the vector vq to our list, equipped with 

error parameter Ag := 4v/ln(|). 

To control the objective function, we also add the all-ones vector f t,j := (1,...,1) with 
A bj := 0. Now we want to argue that LemmaQ] applied to all the vectors vi, vg, v ty defined 
above, provides a solution y that satisfies the claim. The first step is to verify that indeed the 
"entropy condition" (3) is satisfied. As the item sizes for each column sum up to most one, 
we know that Eie[«] II A' 111 s i — m > where m is the number of columns. Each complete group 
hasX/e/ HAIIiSj S C-l, thus the number of groups is t < - c ^ + log{mmi{j : }) < ^ for C large 
enough and each group / contributes e" A / /16 = 1 to {3}. 

Next, consider any group / and let us calculate the contribution just of its subgroups 
G\,...,G t (i) to l[3). The number of I's subgroups is f(-D < A= + 1 < and each subgroup 

G contributes e~ A c /16 = |, thus their total contribution is bounded by A= • | < 1. In other 
words, the total contribution of all subgroups is bounded from above by ™ as well and 
(3} indeed holds and we can apply Lemma [TJ The algorithm returns a vector y such that 

\vi(x-y) \ = for each group / and \vcix-y)\ <4t/ln(|) • || vg\\i for each subgroup G (and of 
course l T x = l T y). 

Finally, consider any item i and suppose it is small with j < Si < j. It remains to show 



that IZ;</-Ay(jc— y)\ < 0(y 51n(|) • k). Now we use that the interval {1,...,/} can be writ- 
ten as disjoint union of a couple of groups + a single subgroup + a small rest. So let t' 
be the index with i e If. Moreover, let G be the (unique) maximal subgroup such that 
G £ {1,..., i}\\Jt"<? If and let R := {1,. .. , z}\(GuUf"<f If) be the remaining row indices. The 
error that our rounding produces w.r.t. i is 



J^Aj(x-y) = v It „(x-y) + v G {x-y) + v R {x-y) 

j<i r"<f' s v ' 





•r 


< 4i /in 






3> 



\V G \\2+ WvrWi 

<2CV8k 



It remains to bound || VgWz- At this point, we crucially rely on the assumption || A < 28 ■ 
|| At || 1. Using this together with Holder's inequality and the triangle inequality we obtain 



II fclb < v 7 !! ^clli • II ^clloo < \\vch-Y. II Alloc < /II vgIIi • 25 £ \\Ai\h = V25\\ v G \h = 0(Vdk) 

and the claim is proven. Note that for large items i we can satisfy | Y.j<i Aj [x- y)\ < 0{y) 
even without using subgroups. □ 

One can alternatively prove TheoremfTolby combining the classical partial coloring lemma 
and the Lovasz-Spencer-Vesztergombi Theorem |LSV86|. Note that the bound of the classi- 
cal partial coloring lemma involves an extra 0(log j) term for A < 2. However, using the 
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parametrization as in |SST| one could avoid loosing a super constant factor. Moreover, in- 
stead of just 2 different group types ("groups" and "subgroups"), we could use an unbounded 

number to save the ^/ln(|) factor. However, this would not improve on the overall approxi- 
mation ratio. 

Recall that for the gluing procedure in Lemma|9]we need the property that the entries in 
x p are not too tiny - say at least polyl 1 og(f;) . But this is easy to achieve (in fact, the next lemma 
also follows from (KK821 or IRotl2l ). 

Lemma 11. Given any instance l>Si>...>s„>^,xe IR™ and parameter y > 0. Then 
one can compute ay> x in expected polynomial time such that all y p are multiples ofy and 
[X,2s) T y< (1,25) T x + 0{y- log 2 n). 

Proof. After replacing x with a basic solution, we may assume that |supp(x)| < n. We write 
x = yx' + yz with z e Z™ and < x' < 1. Now apply logn times Theorem [Tol with 8 = 1 to 
x' to obtain y' e {0,l} m with l T y' < l T x' and \Ej<i Aj{x- y)\ < OQogn- ^-) (if at the end 
of the rounding process, the fractional support goes below 100log(n), we can stop and re- 
move the remaining fractional patterns). Let if be the largest item in size class £. Then for 
£ = 0, ...,log«, we add 0(logn-2^) copies of item if to the waste of y' . Now (1,2s) 1 y' < 
(1, 2s) T x' + 0(log 2 n) and y' > x' . Eventually define y := yy' + yz and observe that y > x, all 
entries y p are multiples of y and (l,2s) r y < (l,2s) r x+ 0(ylog 2 n). □ 

Observe that just applying Lemma [TTJ with y = 1 yields an integral solution with cost 
OPT f + 0{log 2 n). 

6.2 Proof of the main theorem 

It remains to put all ingredients together and show that each of the log n applications of the 
partial coloring lemma increases the objective function by at most OQoglog n). 

Theorem 12. Let 1 > Si > . . . > s n > y- with and x e IR" J Q be given. Then there is an expected 
polynomial time algorithm to compute y>x with 1 T y < 1 T x + 0(log n ■ loglog n) . 

Proof. We choose 8 := B := y := - \ ■ and e := , K* < \yB8. After moving to a basic 

J r ' Tlog ri\ 41og n <s ' r 

solution and buying integral parts of x, we may assume that x e [0, l] m and |supp(x)| < n. 
Recall that if = argmaxjz' | 2" ( ^ +1) < s,- < 2~ e }. We perform the following algorithm: 

(1) FORf=lT01ognDO 

(2) Apply LemmaHUto have all x p being multiples of y. 

(3) Apply Lemma[9]to make x 5-well spread for items of size at most e. 

(4) Apply TheoremQjJIto x to halve the number of fractional entries^! (if |supp(x)| < 
100log(n), just set x p := 1 for all patterns in the support). 

(5) FOR each size class £, add 0(2^) items of if to the waste if si ( > e and 0(t/51n(|)- 
2 ) items if Si e < e. 

6 To be precise, we apply TheoremllOlfor the submatrix of A corresponding to the columns in supp(x). 
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(6) Resubstitute glued items in x and replace waste patterns via a greedy assignment to 
obtain y. 

Let x (r) be the value of x at the end of the rth while loop. Let x (f,2) be the solution x at the end 
of step (2) in the tth iteration. We define (t,5) analogously. Furthermore, x (r,1) be 

x at the beginning of the while loop. 

First of all, observe that x (r,2) satisfies x (f,2) > x (r,1) and x^' 2 ' e yZ>o for all p, by the prop- 
erties of Lemma [TTl The vector x (r,3) emerges from x (f,2) by grouping and gluing and ac- 
cording to LemmaHl it satisfies Ai p < 5 ■ A/x (t,3) < 5 -LpEsupptx^ 3 ') Pi f° r a U P e supp(x tt,3) ) 
and all i with s; < e < \ Finally the conditions of Theorem flOl are satisfied by parame- 
ters 8 and e, thus the extra items bought in step (5) are enough to have x (t,5) > x (f,3) . Note 
that none of the steps (2), (3), (5) increases the number of regular patterns in the support, but 
\{p | xj, t,4) £ {0, 1}}! <\\{p I x{f ,3) t {0, 1}}|, thus x (1 °g n) is indeed integral. 

Hence, by Corollary[8]we know that y will be a feasible solution to the original bin pack- 
ing instance of cost at most (l,2s) r x tlog "' + 1. It remains to account for the increase in 
the objective function. Each application of (2) increases the objective function by at most 
0(y log 2 ri). (3) costs us 0(/31og 2 n) and (4) + (5) increases the objective function by 0(log^ + 

y^lnC 2 ;) log n). In total over log n iterations, 

l T y-l T x< 0(ylog 3 rc) + OQ0 log 3 n) + 0(logn- log i) + 0(\/<51n(§) log 2 n) < O (log n- log log n) 
plugging in the choices for 6, ft, j, e. □ 

Together with the remark from Lemma[3j the approximation guarantee for our main re- 
sult, Theorem[2]follows. Let us conclude with a quick estimate on the running time. Given a 
Bin Packing instance Si,...,s n (with one copy of each item, so n is the total number of items), 
one can compute a fractional solution x of cost l T x < (l+e)OPTf in time 0{(^ + jg) log 5 (-|)) 
with |supp(x)| < n (see Theorem 5.11 in |PST95|). We set e := - and obtain an x with l T x < 
OPTf + 1 in time 0{n e log 5 [n)). It suffices to run the Constructive Partial Coloring Lemma 
with error parameter 8 := ^, which takes time log n j L ) = 0(n 5 ) where il<n- polylog(n) 
is the number of vectors and m < n is the dimension of x. In other words, the running time 
is dominated by the computation of the fractional solution. Finally we obtain a vector y with 
y p e [0, i] u [1 - p 1] and |supp(y)| < n. We move those entries with y p < i to the waste, 
increasing the objective function by at most n • j- = 1 and we roundup those entries with 

7 Remarks 

The obvious question is, how tight is our analysis? Well, suppose that our instance has only 
items of size at least g~jT7io > then it is not clear whether our gluing approach might be ap- 
plicable or not. 

Another interesting observation concerning the application of the Constructive Partial 
Coloring Lemma is the following: recall that we gave vectors v\ for groups and vectors vq 
for all subgroups as input to Lemma [TJ But the solution y returned by that lemma is the 
end point of a Brownian motion and satisfies Pr[|f (x- y)| < A|| 1/H2] < e" n(A ' for every A > 
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and v e R m regardless whether v is known to the algorithm or not. If we choose A G := logn 
(and 5, y, e slightly more generous), then the guarantee \vcix-y)\ < Ac II vg\\ is satisfied for all 
subgroups with high probability anyway and there is no need to include them in the input. 

Moreover, we are not even using the full power of the constructive partial coloring lemma. 
Suppose we had only a weaker Lemma[T]which needs the stronger assumption that for ex- 
ample 52,- (1 + A,)" 10 < S instead of the exponential decay in A. We would still obtain the 
same asymptotic bound of OQog n -loglog n), thus a fine tuning of parameters is not going to 
give any improvement. 

Consider the seemingly simple 3-Partition case in which all n items have size | < s,- < 
\. Both, the approaches of Karmarkar and Karp |KK82| and ours provide a O(logn) upper 
bound on the integrality gap. The construction of Newman and Nikolov |NNN12| of 3 badly 
colorable permutations can be used to define a 3-Partition instance with an optimum frac- 
tional solution x e {0, \} m such that any integral solution y e Z™ with supp(y) Q supp(x) and 
l T y-l T x< o(logn) satisfies maxi € [ n ]{Y.j<i(.AjX- Ajy)} > Q(logn). This suggests that either 
the log n bound is best possible for 3-Partition or some fundamentally new ideas are needed 
to make progress. 

Acknowledgements. The author is grateful to Michel X. Goemans for helpful discussions 
and support and to Nikhil Bansal for reading a preliminary version. 
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